Phonetic effects on listener detection of vowel concatenation
نویسنده
چکیده
Concatenative speech synthesis quality depends in part on the minimization of audible discontinuities between two successive concatenated units. This study focuses on human detection of concatenation discontinuities in synthetic speech. Statistical analyses compared for various phonetic categories the results observed in perceptual tests with two voices – one female and one male. Neither a comprehensive phonetic analysis nor a comparison of discontinuity detection between voices has been reported previously. Although discontinuities were generally more detectable for the female than the male voice, there were many similarities between results obtained from the two speakers. A reliably higher rate of detection of discontinuities was observed for diphthongs than for monophthong vowels. Post-vocalic consonants influenced concatenation discontinuities significantly more than pre-vocalic consonants, and postvocalic sonorants were associated with higher detection rates than post-vocalic non-sonorants. The differences in discontinuity detection among vowels and among consonantal contexts for both voices consistently suggest that highly audible discontinuity is related to concatenation in regions of spectral change.
منابع مشابه
Patterns of individual differences in reduction: Implications for listener-oriented theories
Many listener-oriented theories of phonetic reduction assume that the talker has tacit knowledge of an interlocutor’s mental state, and consequently predict that talkers with poor theory of mind should exhibit inconsistent behavior between semantic contexts. This study examined effects of individual differences in theory of mind skills on extent of phonetic reduction in three acoustic domains (...
متن کاملThe Effect of English Vowel-Recognition Training on Beginner and Advanced Iranian ESL Learners
This study was an attempt to investigate the effect of vowel-recognition training on beginner and advanced Iranian ESL learners. A total of 36 adult Iranian ESL learners (18 advanced and 18 beginners) who were students of various majors at Memorial University (MUN) were recruited for the study. Advanced participants had the experience of living in Canada for at least three years while beginners...
متن کاملCoarticulatory influences on the perceived height of nasal vowels.
Certain of the complex spectral effects of vowel nasalization bear a resemblance to the effects of modifying the tongue or jaw position with which the vowel is produced. Perceptual evidence suggests that listener misperceptions of nasal vowel height arise as a result of this resemblance. Whereas previous studies examined isolated nasal vowels, this research focused on the role of phonetic conte...
متن کاملEnglish L2 Vowel Acquisition over Seven Years
Although cross-sectional research designs have been widely used in the evaluation of L2 phonetic learning, longitudinal studies of L2 speech production are rare. As a result, it is difficult to draw strong conclusions about the effects of language experience on L2 phonetic acquisition. This investigation of adult Slavic (Russian and Ukrainian) and Mandarin speakers tracks their English high vow...
متن کاملData-driven perceptually based join costs
Concatenative speech synthesis systems attempt to minimize audible discontinuities between two successive concatenated units. In unit selection concatenative synthesis, a join cost is calculated that is intended to predict the extent of audible discontinuity introduced by the concatenation of two specific units. A study was conducted that used human perceptual data on the detectability of mid-v...
متن کامل